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Abstract. We prove that the isomorphism of scattered tree automatic linear orders as well 
as the existence of automorphisms of scattered word automatic linear orders are undecidable. 
. For the existence of automatic automorphisms of word automatic linear orders, we determine 

£\j ' the exact level of undecidability in the arithmetical hierarchy. 
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1 Introduction 
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Automatic structures form a class of computable structures with much better algorithmic prop- 
erties: while, due to Rice's theorem, nothing is decidable about a computable structure (given as 
a tuple of Turing machines), validity of first-order sentences is decidable in automatic structures 
' (given as a tuple of finite automata). This property of automatic structures was first observed and 

exploited in concrete settings by Biichi, by Elgot [TT], and by Epstein et al. [T2]. Hodgson [15] 
attempted a uniform treatment, but the systematic study really started with the work by Khous- 
sainov and Nerode [IB] and by Blumensath and Gradel [314] . Over the last decade, a fair amount 
of results have been obtained, see e.g. the surveys |28llj as well as the list of open questions [T5] , 
for very recent results not covered by the mentioned articles, see e.g. [5I10I17IIB] . 

A rather basic question about two automatic structures is whether they are isomorphic. For 
ordinals and Boolean algebras, this problem was shown to be decidable via a characterisation of 
' the automatic members of these classes of structures. On the other hand, already Blumensath 

and Gradel [4] observed that this problem is undecidable in general. In [20], it is shown that the 
isomorphism problem is E\ -complete; a direct interpretation yields the same result for successor 
trees, for undirected graphs, for commutative monoids, for partial orders (of height 2), for lattices 
(of height 4) [26]. Rubin [27] shows that the isomorphism problem for locally finite graphs is 
complete for TJg . In [23] , we show in particular that also the isomorphism problems of order trees 
and of linear orders are Z^-complete. For the handling of linear orders, our arguments rely heavily 
on "shuffle sums" . Consequently, we construct linear orders that contain a copy of the rational 
line (a linear order not containing the rational line is called scattered, i.e., our result is show for 
non-scattered linear orders). This is unavoidable since we also show that the isomorphism problem 
for scattered linear orders is reducible to true arithmetic (i.e., the first-order theory of (N, +, •)) 
and therefore much "simpler" than the isomorphism problem for arbitrary linear orders. But it is 
still conceivable that the isomorphism problem for scattered linear orders is decidable. 

In this paper, we deal with automatic scattered linear orders. In particular, we prove the 
following three results: 

(1) There is a scattered linear order whose set of tree-automatic presentations is TTj'-hard (i.e. one 
can reduce the complement of the halting problem to this problem) . This holds even if we fix 
the order relation on the set of all trees (Theorem [5]) . Hence also the isomorphism problem 
for tree automatic scattered linear orders is Tl^-hard (Corollary [2]). 

(2) The existence of a non-trivial automorphism of an automatic scattered linear order is JC^-hard 
(i.e. the halting problem reduces to this problem, Corollary Q]). Again, this holds even if we fix 
the linear order on the set of all words (Theorem [5]). The existence of an automatic non-trivial 
automorphism is Z'J-complete. 

For regular languages ordered lexicographically, the existence of a non-trivial automorphism 
is decidable (Theorem [I}, but it becomes undecidable for contextfree languages (Theorem [3]). 

(3) The existence of a non-trivial automorphism of a tree automatic scattered linear order is E®- 
hard (i.e., one can reduce the set of Turing machines that accept a finite language to this 
problem, Theorem [6]). 



The proof of (2) uses an encoding of polynomials similarly to [53] but avoids the use of shuffle 
sums. The technique for proving (1) and (3) is genuinely new: One can understand a weighted 
automaton over the semiring (NU { — oo}; max, +) as a classical automaton with a partition of the 
set of transitions into two sets To and T\. The behavior of such a weighted automaton assigns 
numbers to words w, namely the maximal number of transitions from T\ in an accepting run on 
the word w. Krob [22] showed that the equivalence problem for such weighted automata is -/In- 
complete. The hardness results from (1) are based on a sharpening of Krob's result (see [5]): there 
is a fixed weighted automata such that the set of equivalent weighted automata is iT^-hard (and 
therefore undecidable) . A closer analysis of this proof, together with the techniques for proving 
(1) and (2), finally yields (3). 

These results show that the existence of isomorphisms and of automorphisms is nontrivial for 
scattered linear orders that are described by word and tree automata, resp. 

2 Preliminaries 

2.1 Tree and word automatic structures 

Let S be some alphabet. A E-tree or just a tree is a partial mapping t : {0, 1}* — > E such that 
uv G dom(i) implies u G dom(i), and ul 6 dom(t) implies m0 G dom(i) (note that we allow the 
empty tree with dom(0) = 0). A (bottom up) tree automaton is a tuple A — (Q, i, A, F) where Q 
is a finite set of states, l is the initial state, A C Q x E x Q 2 is the transition relation, and F C Q 
is the set of final states. A run of the tree automaton A on the tree t is a mapping p : dom(i) — > Q 
such that 



holds for all u G dom(i). The run p is accepting if p{e) G F . The language of the tree automaton A 
is the set L(A) of all trees t that admit an accepting run of A on t. A set L of trees is regular if 
there exists a tree automaton A with L(A) = L. 

It is convenient to understand a word as a tree t with dom(t) C 0* (then t(e) is the first letter 
of the word). Nevertheless, we will use standard notation for words like uv for the concatenation 
or e for the empty word. A word automaton is a tree automaton A = (Q, t, A, F) with 



This condition ensures that word automata accept words, only. 

Let ti, . . . , t n be trees and let # ^ S. Then U# = Z'U{#} and the convolution <2, ■ ■ ■ , t n ) 
or t\ (g> <2 ® • • • (8 in is the 17^-tree i with dom(i) = Ui<;<n dom(t) and 



Note that the convolution of a tuple of words is a word, again. For an n-ary relation R on the set 
of all trees, we write i?® for the set of convolutions ®(ti, . . . , t„) with (ti, . . . , t n ) € R. A relation 
i? on trees is automatic if i?® is a regular tree language. 

A relational structure S = (L; R\, . . . , i?^) is tree automatic if the tree languages L and i?® 
for 1 < i < k are regular; it is word automatic if, in addition, I is a word language. A tuple of 
tree automata accepting L and Rf for I < i < k is called a tree or word automatic presentation 
of the structure S. 

2.2 Linear orders 

For words u and u, we write u < pre f u if u is a prefix of v. Let be some set linearly ordered 
by <. Then <i cx denotes the lexicographic order on the set of words E*: u <i cx v if u < prc f v or 




for v G dom(t) 
otherwise 





there are x, y, z 6 S* , a, b G S with u = xay, v = xbz, and a < b. From the lexicographic order 
on U*, we derive a linear order (denoted <f ox ) on the set S* (g) S* of convolutions of words by 

»®d <f cx u' ®v':-^u <i cx u' or u — u' , v <i cx v' . 

By <iioxj we denote the length-lexicographic order defined by u <n cx v if |u| < |i>| or \u\ = \v\ 
and u <ie X u. We next extend this linear order <n cx to trees. Let t be a tree. Then t\o* (more 
precisely, t f(o*ndom(t))) is a word that can be understood as the "main branch" of the tree t. For 
u E {0,1}*, let t\ u denote the subtree of t rooted at u (i.e., Aouv{t\ u ) — {v \ uv G dom(i)} and 
t\u{v) = t(uv) for u G {0, 1}* as well as t\ u = fy for u ^ dom(i)). Furthermore, r(t) is the tuple of 
"side trees" of t, namely 

r (*) — \t ro i i)o i edom(t) ■ 
We now define the extension <trocs of <n cx to trees setting s <trccs t if and only if 

— s is the empty tree or 

— s\o* <iicx t\ * or 

— s\q* = ifo* and there exists i (with 0* € dom(s)) such that s\ ii = t\ ii for all < j < i and 
sfou <trccs tlon- 

In other words, we first compare the main branches of the trees s and t length-lexicographically 
and, if they are equal, compare the tuples r(s) and r(t) (length-)lexicographically (based on the 
extension < trccs of the length-lexicographic order to trees). Since the "side trees" t fon of any tree t 
are properly smaller than the tree itself, the relation <t re es is well-defined. Note that all the order 
relations < prc f, <i cx , <f cx , <n ox , and < tr ccs are automatic. 

A linear order C is scattered if there is no embedding of the rational line (Q; <) into C. Examples 
of scattered linear orders are the linear order of the non- negative integers oj, of the non-positive 
integers oj* , or the linear order of size n £ N that we denote n. If E is an alphabet with at least 2 
letters, then (£*; <iiex) — u is scattered, too. On the other hand, if a, b € S are distinct letters, 
then ({aa, bb}*ab; <i cx ) = (Q;<). Hence (£*;<i cx ) is not scattered. From [HJ Prop. 4.10], we 
know that the set of word automatic presentations of scattered linear orders is decidable. 

A linear order C = (L; <) is rigid if it does not admit any non-trivial automorphism, i.e., if 
the identity mapping f : L — > L : x ^ x is the only automorphism of C. The linear orders uj, uj* , 
and n for n G N are all rigid. On the other hand, (Q; <) as well as (Z; <) are not rigid. 

Note that automorphisms of tree automatic linear orders are binary relations on E* . Hence 
it makes sense to speak of an automatic automorphism. An automatic structure is automatically 
rigid if it does not have any non-trivial automatic automorphisms. 

Let X — (I; <) be a linear order and, for i G I, let d — (Li; <i) be a linear order. Then the 
£-su of these linear orders is defined by 



»£(/;<) 

For J2ie2 £»i we simply write C\ + £2- If, for all i G I, Ci = C, then we write C-I for X^e(/ <) 
Note that £ -X is obtained by replacing every element of I by a copy of L. As an example, consider 
the linear order S — uj ■ uj* . This linear order will be used as "delimiter" in our constructions. It is 
isomorphic to (N x N; <s) with 

<8 (k,i) :<^> j > I or j — I and i < k . 

1 Shuffle sums mentioned in the introduction are special cases of this construction where X = (Q; <) is 
the rational line and, for every q £ Q, the set {r € Q | C q = C r } is dense. 
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Hence it forms a descending chain of ascending chains. Therefore, it has no minimal and no 
maximal element, is rigid and scattered. Note that 

6= (10+1+0; < lex ) 

where we assume < 1. The isomorphism is given by i— > 1CF+ 1*+ 0. 

Also for later use, we next define a regular set D = {ti_j \ i,j > 0} of trees such that S = 
(D; <trees)- The alphabet of these trees will be the singleton {$} so that a tree is completely given 
by its domain. Then set inductively 

domOoj) = {e,0,00}Ul{0 fe | < k < j} and 
dom(t i+ ij) = {e, 0, 00} U 01 dom(t itj ) 

The trees to, 4 and £2,2 are depicted in Figure [T] (left-arrows denote 0-sons, right-arrows denote 
1-sons). The tuple r(tij) has the following form 

T(t ,j) = (^,0,0) with dom(ij) = {0 k | < k < j} and t(U +1j ) = (0,^,0) . 

Note that all trees t%j coincide on their main branch, i.e., tij \q* = tk,e\o*. Hence <trccs tk,i if 
and only if r(ti,j) is lexicographically smaller than r(tk^)- But this is the case if and only if 

— — k < i or 

— = i = k and j < £ or 

— < i, k and U-ij <trees tk-l,l- 

By induction, this is equivalent to i > k or i — k, j < t. Hence (D\ <trecs) — S. 



Fig. 1. Two trees from D 



3 Automorphisms of linear orders on words 

In this section, we consider linear orders on sets of words. The universe will be regular or contextfree 
and the order will mainly be the lexicographic order <i cx and its relative <^ ex . 

3.1 Regular universe and <i ex 

Courcelle [7] initiated the study of regular words, i.e., labeled linear orders derived from frontiers 
of regular trees. Thomas proved that the isomorphism problem for these words is decidable (29] , 
the complexity of this problem was determined by Lohrey and Mathissen [M] . 

Based on techniques and results from [2], we will show that, given a regular language L, it 
is decidable whether (L; <i ex ) is rigid. This proof requires the consideration of regular words: 
An extended word is a labeled linear order with a finite set of labels. A regular word over the 
alphabet A is an extended word (L; <, A) with A : L — > A such that 



— L and A 1 (a) for a <E A are regular subsets of S* and 

— < is the lexicographic linear order <i cx . 

Regular words can be described by terms over A that we define next. These terms use constants 
a 6 A (standing for the extended word on 1 whose only element is labeled o) and the following 
operations: 

— concatenation of words (denoted /i + v) 

— w-power (denoted /i • lj) 

— w*-power (denoted /i • lj*) 

— shuffle (denoted [i>i, ^2, ■ • ■ , ^k} 71 ) for arbitrary k > 1. 

The semantics of the concatenation, w-power and w*-power generalize the corresponding operations 
for linear orders in the obvious way. To dehne the extended word [v\, . . . , fk] 71 , let A : Q — > 
{1, 2, . . . , k} be a mapping such that A _1 (i) is dense for all 1 < i < k. Then set v(q) = v\( q ) for 
q E Q and define 

[v u ...,v k }*> = "(?) 

9G(Q;<) 

as we did for linear orders. For a term t, let \t\ denote the extended word it describes. 

Let v = (L; <\ cx , A) be a regular word given by finite automata that accept L and A -1 (a) for 
a e A (without loss of generality, we can assume e £ L). Let Pref(L) C S* denote the set of 
proper prefixes of words from L. Then T = (Prcf(L) U {w$a | wa 6 L}; < pre f) is a regular tree 
whose leaves are of the form w$a for too £ L. Let $ be the least letter of U U {$}. Then we can 
recover v by reading the leaves of the tree from left to right and label them by their last letter. 
From this regular tree, we can read off a system of equations as follows: Let Ti, T2, . . . , T n denote 
the subtrees of T (up to isomorphism) with T = T\. We have n variables x%, X2, ■ ■ ■ , x n and the 
system of the following equations: if Ti is not a singleton, then we have the equation 

X% X-i^ X22 • ■ • Xj,^ 

where , Ti 2 . . . Ti k are the subtrees of Ti rooted at the children of the root. If T,; is a singleton, 
and if its only node is labeled a £ A, then we have the equation 

Xi = a . 

Then the regular word v is the initial solution (in the sense of [7]) of this system of equations. 

From this system of equations, one can compute a term t with |t| = v (Heilbrunner |14|). Thus, 
to decide whether v has a nontrivial automorphism, we have to be able to decide, given a term t, 
whether \t\ has a nontrivial automorphism. 

Let v — (L; <, A) be an extended word. On the set L, we define an equivalence relation ~ by 
x ~ y if (where we assume x < y) 

— the interval [x, y] is finite or 

— for any x\ y' ', z e [x, y] with x' < y', there exists z' S {x 1 , y') with A(z) = A(z'). 

Bloom and Esik [2] define a (decidable) class of terms D(A) (called primitive terms in normal 
form) with the following properties 

— If v is a regular word with a single ^-equivalence class, then there exists a term t £ D(A) with 

1*1- 

— If t £ D(A), then \t\ has a single ^-equivalence class. 

— If s,t £ D(A) with \s\ = |t|, then s = t. 

Let v = (L; <i ox , A) be a regular word. The equivalence classes with respect to ~ are convex 
sets. Hence they can be ordered by 

[x]^ <' [y]^ :<=> x < y and i/t/ 



such that (L/~; <') is a linear order. For X £ L/~, the restriction of v to the equivalence class X 
is a regular word with a single ^-equivalence class. Hence there exists a unique term tx € -D(A) 
with |i| = z/fX. Define A' : Lj > D(A) by X ^ t X - Then 

c{v) = {L/~-<',\') 

is an extended word with possibly infinite alphabet. 

To decide whether \t\ is rigid, we proceed as follows: Using the algorithm by Bloom and Esik 
[2J, we construct a term c(t) with \c(t)\ = c(\t\), in particular, c(\t\) has a finite alphabet. From 
this term c(t), we can extract the set D of terms from D(A) that appear in c(t). Then we observe 
that \t\ has a nontrivial automorphism if and only if 

— c(\t\) = \c(t)\ has a nontrivial automorphism or 

— there exists a ^-equivalence class X such that \t\\X has a non-trivial automorphism. 

Note that s G 13 if and only if there exists a ~-equi valence class X with \t\\X = |s|. Hence the 
second item holds if and only if there exists s € D such that \s\ has a nontrivial automorphism - 
but this is the case if and only if s is of the form u ■ to* + v ■ to or [m, . ■ ■ , Uk] v . To decide whether 
\c(t)\ has a nontrivial automorphism, we call this process recursively. From [21], we observe that 
c™(|t|) is a singleton for some n G N, hence this recursive procedure stops eventually with \t\ a 
singleton. 

Formulated for linear orders, we therefore showed 
Theorem 1. The set of regular languages L such that (L; <i ex ) is rigid, is decidable. 

3.2 Regular universe and <f ex 

The situation changes completely when we move from the lexicographic order <i cx to the linear 
order <j 2 cx since, as we will see, rigidity of (L; <f ex ) is undecidable for regular languages L. 

Let p,q 6 be two polynomials with coefficients in N and variables among x — (xi, . . . , Xk)- 
Then define the linear order 

E (jp(x) + S) ■ lj* + (q(x) + S) ■ u)) . 

2£(N fc ;<i cx ) 

This linear order forms an w-sequence of "blocks" of the form 

B{m, n) = (m + 5) ■ uj* + (n + 5) ■ uj 

with to, n G N. Therefore, every automorphism of has to map every block onto itself. In other 
words, Cp^q is rigid if and only if all these blocks are rigid. But B(m,n) is rigid if and only if 
to =^= n. Hence we showed 

C P) q is rigid Wx G N fc : p(x) ^ q(x) . (1) 

Finally note that C PA is scattered since 5, u>, and ui* are all scattered. 

We now prove that C PtQ is automatic or, more specifically, we will construct a regular set 
L C {0, 1}+ ® {0, 1}+ such that C p . q S (L; <2 ox ) (see Lemma H below). 

Let A = (Q, l, A, F) be a word automaton over the alphabet £ and let w G S + be a word. 
Then Run(„4, w) is the set of all words over A of the form 

(q ,a 1 ,q 1 ,L)(q 1 ,a 2 ,q2,i-) ■ ■ ■ (qk-l, ajfc, t, i) 

with it) = aia2 . . . afc and qo £ f - These words encode the accepting runs of the word automaton A 
(recall that word automata are special bottom up tree automata which explains the unusual 
position of the initial and final states in the run). Furthermore, let Run(_4) = {J wGS + Run(,A, w). 



Lemma 1. From polynomials p,q € N[xi, . . . , X&], one can construct an alphabet S and a regular 
language K C E + ® S + such that (K; <f cx ) = £ P , q - 

If C Pi g has a non-trivial automorphism, (K; <i ex ) has a non-trivial automatic automorphism. 

Proof. Let p and q be polynomials from N[xi, . . . , Xk]. For x = (x%, . . . , Xk) S N fe , set 

a x = a Xl $a X2 $---$a Xk $ e (a*$ k . 

Then, as in the proof of [231 Lemma 7] , one can construct nondctcrministic finite automata A p = 
(Qp,Lp,A p ,F p ) and A q = (Q q ,i q ,A q ,F q ) with L(A p ),L(A q ) C (a*<;) fc , such that, for x e N fe , the 
NFA A p has precisely p(x) many accepting runs on the word a x , i.e., |Run(^ p , a x )\ = p(x), and 
similarly \Run(A q ,a x )\ = q(x). We will assume A p C\ A q — 0. 
Define the language K by 

K= |J a £ 0+l «> (Run(^ p , a 5 ) U 32+3+2) 
U |J a 2 l+0® (Run(^, a 5 ) U 32+3+2). 

Hence any word from K is the convolution of two words over the alphabet 

27 = {a,<:,0,l,2,3}U4pLM g . 

We have to show that the language K is effectively regular. Here, the crucial point is the 
regularity of 



[J a x + l ®Run(Ap,a x ) 



(J a x ®Run(A p ,a x ) 

26N fc 



(0+1® {e}) 



(this equality holds since \w\ = \ W\ for any w € (a*<;) fe and £ Run(„4 p , w)). But a word belongs 
to the language in square brackets if and only if it is the convolution of a word w from the regular 
language (a*<$) k and a run of the automaton A p on this word w, a property that a finite automaton 
can check easily. 

On the alphabet E, we now fix a linear order < such that 

Z\ p UZ\ 9 <0<l<2<3<<;<a. 

The associated order <[ 2 CX on the language K can now be characterized as follows: 

a x b m (l - b) <g> r <i 2 ox a s c"(l - c) <g> s 

(with 6, c £ {0, 1}) if and only if 

(i) b = 0, c = 1 and x <i ox y, or 

(ii) 6=1, c = and x <i cx y, or 

(iii) b = c and 

(111.1) x <i GX y, or 

(111. 2) x = y, b = 0, and m > n, or 

(111. 3) x = y, 6 = 1, and m < n, or 

(111. 4) x = y, m = n, and 

(111. 4.1) r G Run(.4 p ) URun(„4 9 ) and s E 32+3+2, or 

(111. 4. 2) r, s G Run(^l p ) U Run(^4 q ) and r <i ox s, or 

(111.4.3) r, s € 32+3+2 and r < lox s. 



We show (K; <f cx ) = C- P , q - For x £ N k and m > 1, let Zr,o m i denote the restriction of (K; <f ex ) 
to the set a s O m l ® (Run(„4 p , a 2 ) U 32+3+2) C if. By (iii.4.1), Xg,o«H is isomorphic to the sum 
of the restrictions of (K; <l x ) to the sets a s O m l <g> Run(„4 p ,a s ) and a s O m l <8> 32+3+2, resp. By 
(iii.4.2) and the choice of the automaton A p , the first restriction is isomorphic to p(x). Recall that 
(32+3+2; <i ex ) — S. Hence, the second restriction is isomorphic to S by (iii.4.3). In summary, 

2x,o m i = P(x) + $■ 

Next, let X sfi+1 denote the restriction of (K; <f cx ) to the set a 2 0+l ® (Run(„4 p , a 2 ) U 32+3+2). 
Note that, by (hi. 2), we have 



2^,0 m+1 l <lex 3-x,o 



for all to > 1. Hence 



Zx,o+i = ^ Zg,o-™i - + 8) -v* 



With X Sil+0 the restriction of (K; <j 2 cx ) to the set a s l+0 ® (Run(A?, a s ) U 32+3+2), we obtain 
similarly 

^x.i+o = X! ^ lm ° - ( g(^) + <5) ' ^ 

m>l 

(the reason for the factor co instead of u* above is the difference between (hi. 2) and (hi. 3)). 
Finally, for x £ N fe , let X s denote the restriction of (K; <f ex ) to the set 



a s 0+l ® (Run(^ p , a 5 ) U 32+3+2) 



U 



a s l+0 ® (Run(^ 9 , a 5 ) U 32+3+2) 



Then (i) with x = y and the above imply 

Xr = 2^0+1 +^x,l+0 - ( g(g) + <*) ' w * + ( g(g) + <*) ' w • 

Together with (i), (ii), and (iii.l), this ensures 

(K; <L) = E ^ = 4>,« • 

SG(N fc ;< lox ) 

Now suppose that £ Pjg has a non-trivial automorphism. Then, as we saw above, there is y £ N k 
such that p(y) = q(y). From the construction of the automata A p and A q , we infer |Run(^ p , cfi)\ = 
\Rwn(A q ,aP)\. Let 



Run(„4 p , a v ) = {pi, ...,p n ) and Run(„4 g , a v ) = {ai, ...,a n ) 



with 



Pi <lcx Pi <lcx ' ' ' <lcx Pn and 1 <l x <J2 <lox 1 ' 1 <lcx On • 

Now dehne a mapping / : K — > K by 



/(a s 6 m (l-&)®r) = < 



a ffi 6 m (l - b) ®r ifx^y 

a s 6 m_1 (l - 6) ®r if x = y, & = 0, m > 1 

a^l0<g>r if x = y,b = 0, m = l,r £ 32+3+2 

a^lO ® CTi if x = y, 6 = 0, m = 1, r = pi 

a yb m+1 {l-b)®r ifx = y,fc = l 



This mapping hxes all elements of K not belonging to Xg. On this linear order Xg, it acts as an 
automorphism. Hence / is a non-trivial automorphism of (K; <u ex ). Note that the universe of Xg 
is regular. It follows that /® is regular. 



Lemma 2. From polynomials p,q G N[xi, . . . , Xk], one can construct a regular language L C 
{0, 1}+ <g> {0, 1}+ such that (L; <f cx ) £ p , q . 

If C p _ q has a non-trivial automorphism, then there exists a non-trivial automorphism h of 
(L; <i ex ) such that h® is regular. 

Proof. Let p,q £ N[xi, . . . , Xk] be polynomials, let K be the language from Lemma U and let 
(£; <) be the sequence 

(Tl < (72 < • • • < at . 

Furthermore, let g denote the monoid homomorphism from S* to {0, 1}* defined by g{o~i) — ¥0 e ~ l 
for 1 < i < i. Now set L = {g(u) (g> g(v) \ u (£) v E K}. Then g is an isomorphism from (K; <f cx ) 
onto (L; <f cx ). Since all the words g(ai) have the same length, the language L is also regular. 

If Cp^ q has a non-trivial automorphism, then, by Lemma [TJ there is a non-trivial automor- 
phism / of (K; <u ex ) such that /® is regular. Hence g = h^ 1 o f o h is a non-trivial automorphism 
of (L; <n cx ). Note that h® is regular. It follows that also g® is regular. 

Theorem 2. (i) The set of regular languages L C {0, 1} + ® {0, 1} + smc/i that (L; <\ BX ) is rigid 
(is rigid and scattered, resp.), is Tl^-hard. 
(ii) The set of regular languages L C {0, 1} + ® {0, 1} + such that (L; <i 2 ox ) is automatically rigid 
(automatically rigid and scattered, resp.) is Tli-hard. 

Proof. (i) The set of pairs of polynomials p,q € N[x] with p(y) ^ q(y) for all y £ N fe is In- 
complete [25]. We reduce this to the first set in question: Let p,q £ N[x] and let L be the 
regular language from Lemma [2 Then (L; <f ex ) = C P , q is rigid if and only if p(y) ^ q{y) for 
all y G N fc by Q. 

Note that this is even a reduction to the second set in question since the linear order C p ^ q is 
scattered. 

(ii) Be Lemma [5J is rigid if and only if (L; <i 2 lcx ) is automatically rigid. Hence the above 
reduction also proves the two claims from (ii). 

Corollary 1. (i) The set of word automatic presentations of rigid (rigid and scattered, resp.) 
linear orders is Tl^-hard. 
(ii) The set of word automatic presentations of automatically rigid (automatically rigid and scat- 
tered, resp.) linear orders is TI^ -complete. 

Proof. The two claims from (i) are obvious consequences of Theorem dfi). Analogously, the two 
hardness claims from (ii) follow immediately from Theorem HJii). 

Now let (L; <) be an automatic linear order given by a word automatic presentation. Let 
R C S + x S + . Then it can be expressed in first-order logic that R is a non-trivial automorphism 
of (L;<). Hence, given a finite automaton A for a regular language R® C S + (gi E + , one can 
decide whether R is a non-trivial automorphism of (L; <) |18j . Consequently, automatic rigidity 
of (L; <) is a TT^-property. 

3.3 Contextfree universe and <i ex 

Esik initiated the investigation of linear orders of the form (L; <i ex ) where L is contextfree. Density 
of such a linear order is undecidable [13] , the isomorphism problem is i^-complete |23j and their 
rank is bounded by [B]. 

We will show that rigidity of (L; <i cx ) is undecidable for context-free languages L. The proof 
uses the linear order C PtQ and constructs a deterministic context-free language L' such that (L'\ <i ox 
) — C"p.q- This construction is a variant of the construction in the proof of Lemma [TJ 

Lemma 3. From polynomials p,q G N[xi, . . . ,Xk], one can construct a deterministic contextfree 
language L' C {0, 1} + such that (L'; <i ex ) — £-p,q- 



Proof. Let p,q G N[xi, . . . ,Xk] be polynomials and let K be the language from Lemma [T] Then 
set 

K' = {u$v rev | u <g> « e L} 

where v 1 * 611 is the reversal of the word v. Then, from a deterministic finite automaton A accept- 
ing K rev , one can construct a deterministic pushdown automaton accepting K' (reading u$v, it 
stores u in the stack and, after reading $, simulates A while emptying the stack). Note that the 
alphabet of K' is 

E' = {$} U E = {$, a, 0, 1, 2, 3} U A p U A q . 
We order the alphabet E' by <' such that 

A p U A? <' <' K' 3 <' 2 <' <: <' a <' $ . 

Compared to the proof of Lcmma[TJ the order of 2 and 3 is inverted and $ is made the new maximal 
element (we could have placed $ anywhere) . With < the order on E from the proof of Lemma [1] 
one effect of this definition is (32+3+2''™; <{ ox ) S (32+3+2; <i ex ) = 5 which will be used below. 

To show (K';<[ ex ) = C Ptq , is suffices to prove (K';<[ ex ) = {K;<f cx ). For this, recall that 
(K; <f ex ) is a sequence of the following blocks (for x G N k and m > 1): 

— (a x m l <E) Run(^l p , a x ); <? ex ): This linear order is finite of size |Run(^l p , a x )\. The same holds 
of the linear order 

(a x m l${r rev | r G Run(^ p , a x )}; <[ cx ) . 

— (a x l m <E) Run(^4 g , a x ); <f ex ): As above, this is isomorphic to 

(a x l m 0${r rev | r G Run(^,a £ )}; <( cx ) . 

— (a x b m (l — b) (S> 32+3+2; <^ ox ) (for b G {0, f }) which is isomorphic to S. But S is also isomorphic 
to 

(a 2 6 m (f-6)$23+2+3;<; cx ) 

as we saw above. 

ft therefore follows that (K; <j 2 ox ) and (K 1 , <( ex ) are isomorphic. The construction of L' C {0, 1}+ 
then follows the proof of Lemma O 

Now we obtain, in the same way that we proved Theorem^ the following result. 

Theorem 3. The set of contextfree languages L C {0,1}+ such that (L; <i cx ) is rigid (is rigid 
and scattered, resp.), is H^-hard. 

4 Isomorphisms and automorphisms of linear orders on trees 

In this section, we will show that the isomorphism of scattered and tree automatic linear orders is 
undccidablc. Furthermore, we will prove that the existence of a non-trivial automorphism in this 
case is Z^-hard. Both these results use (an improved version of) a theorem by Krob [22] that we 
discuss first. 

4.1 Weighted automata and Minsky machines 

A weighted automaton is a tuple A = (Q, E, l, /z, F) where Q is the finite set of states, E the 
alphabet, i G Q is the initial state, F G Q is the set of accepting states, and fi : Q x E x Q —> 
{— oo, 0, 1} is the weight function. 

A run of A is a sequence p =)(qo, oi, qi) . . . (qk-i, au, qk) € A+ with q = i, a,, g,) G 

{0, 1}, and qk G F. Its label is the word a\ ...ctk G E + . By Run(.A, w) we denote the set of runs 
labeled w and Run(yl) denotes the set of all runs of A. The weight wt(p) of the run p is the number 
of indices i with jLt(gj_i, ai,qt) = 1. The behaviour \ \A\ \ of .A is the function from E + to NU {— oo} 
that maps the word w to the maximal weight of a run with label w. 



Theorem 4 (cf. proof of [9, Theorem 8.6]). From a Minsky machine M, one can construct 
a weighted automaton A and a regular language CT rcg C [S ■ □)+ such that, for any m G N, the 
following are equivalent: 



1. m is not accepted by M.. 

2. \\A\\{u) > for all u G CT rog with m = max{n | $n(an)" < prc f u}. 



Furthermore, \\A\\(u) G N for all u G CT 



rcg ■ 



Proof. This is a slight adaptation of the proof of [HI Theorem 8.6]. If r is the function defined 
there, we construct the weighted automaton A such that 



\\A\\(u) 



r{a\a% ■ ■ ■ ah) if u = oiDoaD . . . OfcD 
otherwise. 



The reason for this modification is that here, transition weights are from {— oo, 0, 1} while, in [5], 
we also used the transition weight 2. 

From the weighted automaton A, one can then construct (cf. [819] ) weighted automata Am on 
the alphabet S and Bm on the alphabet such that 

\\A M \\(u) = max(L 1 | i J + 1, | \A\ | (uj) and (2) 

\\A\\(u) if x = u<g>$n(an) m ,ue CT rcg , 

and to = max{n | $n(an)" < pre f u} 
\\Bm\\(x) = { \\A M \Kv) if x = u ® $D(aD) m and (3) 

(u ^ CT reg oim/ max{n | $n(an)™ < pre f u}) 
otherwise 



for all u G S + and x G 

For m G N, we define the function rjvi,™ : S + — > N by rM,m( u ) — \\Bm\\(u ® $n(an) m }. 
This is well-defined since, for any u G S + and to G N, we have ||^4||(u) G N and therefore also 
$n(an)" 1 ) G N. In other words, we have 

I ||.4||(u) if u G CT rcg and to — max{n | $n(an) n < pre f u} 
rM, m {u) = < (4) 
I | \Am 1 1 \ u ) otherwise 



Proposition 1. For all to G N, the following are equivalent: 

1. m is not accepted by the Minsky machine A4. 

2. \\Am\\{u) = rM,m(u) holds for all u G S* ■ 



Proof. Suppose that m is not accepted by the Minsky machine M.. Let u G U* \ CT reg orin/ 
max{n | $n(an)™ < pro f u}. Then (@} immediately implies = rM,m{u). Next let u G 

CT rcg and rn = max{n | $n(an)™ < prc f u}. Then, by Theorem [4j we have ||„4||(u) > ^\u\ 
implying \\A\\(u) > [±\u\\ + 1 and therefore ||.A>f||(u) = ||-4||(u) = r M ,m{u) by © and Q. 

For the other direction, assume that to is accepted by the Minsky machine M.. Then, by 
Theorem 01 there exists a word u G CT reg with m — max{n | $n(an)™ < pro f u] such that 
|_4.| | (w) < \\u\. Note that the word u has even length since u G CT rcg . Hence ||^4||(u) < L^MJ +1- 
Consequently ||Am||0) = [\\u\\ + 1 > ||-4||(u) = r M ,m(u) by © and (0}. 



4.2 Isomorphism 

For a function r : U + —> N, we set 



i>€(£+;<iie*) 



Since <u ex ) — w i this linear order is an w-sequence of ordinals, separated by our delimiter 5. 
Hence it is scattered. Furthermore, we obtain 

C r = C r > <^=> r = r' (5) 

for all functions r, r' : S + — > N. 

Lemma 4. From a weighted automaton A, one can compute a regular set of trees La such that 

(Aa! <trocs) — £||.A||- 

Before we prove this lemma, we show how we can use it to prove that the isomorphism problem 
of scattered tree automatic linear orders is undecidable (the proof of Lemma [4] can be found 
following the proof of Corollary [2]) . 

Lemma 5. From a Minsky machine M. and m 6 N, one can compute a regular set of trees L 

such that (L; <trees) — £r M , m - 

Proof. Let A4 be a Minsky machine and let m G N. Let Bm be the weighted automaton constructed 
following Theorem HI Then, from m £ N, we can compute a weighted automaton BM,m with 
alphabet E such that 

\\B M , m \\{u) = \\B M \\{u® $D(an) m ) for all u e S+ . 

But then ||£>.M,m|| = i"M,m- By Lemma HI we can compute, from m G N, a regular language of 
trees L such that (L; <trees) — £\\B M , m \\ = £-r M , m - 

Theorem 5. There is a scattered linear order C such that the set of regular tree languages L with 
(L; <trocs) — C is Il^-hard. 

Proof. Let PCNbe some TJ^-complete set. Then there exists a Minsky machine Ai that accepts 
the set N\ P. Let Am and Bm be the weighted automata constructed following TheoremlH Then 
we get 

m G P <J=^ m is not accepted by M. 

Vt ^\\Am\\(u) = r M . m (u) for all u E U+ (6) 
£\\A M \\ - £>r M , m 

where the last equivalence follows from ([5]). Hence, by Lemma we can reduce the TT^-complete 
set P to the set of regular tree languages L with (L; <t re es) — £\\A M \\- The theorem therefore 
holds with C — C\\a m \\- 

Since the linear order <trees is tree automatic, we immediately obtain 

Corollary 2. There is a scattered linear order C whose set of tree automatic presentations is 
n^-hard. 

One immediately gets that the isomorphism problem for tree automatic scattered linear orders is 
TT^-hard. We do not know whether the set of tree automatic presentations of scattered linear orders 
is decidable. Therefore, the following immediate consequence of Corollary [2] is a bit stronger: 



Corollary 3. Let X be a set of pairs of tree automatic presentations such that, for all tree auto- 
matic presentations Pi and P 2 of scattered linear orders C\ and £2, one has 

(P 1 ,P 2 )eX ^ Ci=C 2 . 

Then X is Pt^-hard. 

The rest of this section is devoted to the proof of Lemma |U 

Proof (Proof of Lemma\^ . Let A = (Q, E, l, /1, F) be a weighted automaton. We will construct a 
tree automatic presentation of the linear order 

A run tree of A is a tree t over the alphabet £ W {$} such that there exist states t = 
goj 3i) • • • , Qk-i £ Q a n d qu G F (with k — max{i | ,+1 G dom(f)}) with the following prop- 
erties: 

(Tl) 11 e dom(t) C 0* U 0*10* U 110* and 100 g dom(£) 
(T2) £(0 4 ) e £ and ^fe-i, t(0 l ), qi) + -00 for all 1 < i < k 
(T3) 0*1 s dom(i) implies 1 < i < k and a,, g») = 1 or i — 

(T4) t~ 1 ($) = dom(t) \ {0 l I 1 < i < k} 

Note that every run tree t defines a word over £, namely 

word(t) =i(0)t(00) ... t(0 k ). 

Since 11 G dom(i), also 1 and therefore belong to dom(£). Hence word(t) ^ e. Fig. [2] shows 
a run tree £ with word(£) = abaab (we omitted the label $ in the figure). The idea is that the 
mam branch" {0, 00, . . . , fc } carries a run p of the weighted automaton A. The number of "side 
branches" starting in some node l l with i > is at most the weight wt(p) of the encoded run. 
Since these side branches have arbitrary length, the whole run tree stands for an element of u; wt ( p ). 
The "side branch" starting in 11 plays a special role, its length |dom(£) D 110 + | is denoted n(£) 
(the run tree £ in Fig. [2] satisfies n(t) = 2). 




Fig. 2. A run tree 

We next define, for two trees s and £, the tree s + t by adding a new $-labeled root and 
considering s as left subtree of s + t and £ as right subtree. More formally, dom(s + £) = {e} U 
0dom(s) U ldom(£), (s + £)(e) = $, (s + t)(0u) = s(u) for u E dom(s), and (s + t)(lv) = t(v) for 
v G dom(£). Since we consider words as special trees, we will meet trees of the form w + 1. These 
trees carry the sequences Sw on dom(u> + £) n 0* and satisfy (w + t) \\= t. 

We now define the language L_a by 



La = {t I £ is a run tree} U {w$ + t\w G £ + ,t G D} 



where D is the set of trees from page 0] that satisfies (D; <t re es) — S. This language is clearly 
regular. 

Note that trees from L^ use the alphabet UU {$} that we order arbitrarily. We will now prove 

(La'i <trees) — £\\A\\ ■ 

First let w £ S + and n £ N. Then let Z^ denote the restriction of (La', <trcos) to all run 
trees £ with 

word(£) = w and n(£) = n . (7) 

Note that for any two run trees s and t satisfying ([7)1. we have sfo* = £|V and s|"i = ^ Pi ■ Hence 
s <trees t if and only if there exists i > 1 with £|"oii = ston f° r all 1 < j < i and £to*i < trees sfon- 
By (T3), dom(£) n + l contains at most \w\ elements. Furthermore note that the trees tfon can 
be identified with natural numbers (namely with dom(£) n 0'10*|). This shows that n can be 
embedded into (N'™'; <i cx ) and is therefore well-ordered and at most lj\ w \. 

Now let p = (qo, a%, qi)(qi, a-i, Q2) ■ ■ ■ (Qk-ii a k:Qk) £ Run(„4, w) be a run of the weighted au- 
tomaton A on the word w — a\ . . . at - For any tuple (mi, . . . , m&) £ N fc such that 

mi > ==> n(qi-i,ai,qi) = 1, 

there exists a unique run tree £ satisfying ([7]) and |dom(£) n 0*10* | = m, for all 1 < i < k. This 
gives an order-preserving embedding f p : cj wt ( p ) — > I^ n , i.e., we showed cj wt ( p ) < Z^ n . Since this 
holds for abitrary runs p £ Kun(A,w) and since ||^4||(w) = max{wt(^>) | p £ Rmyi(A,w)}, we get 
JI-AIKV) < ji n and therefore 

w l|A||(w)+i <x i . w 

^ — -Su,^ w ■ 

By (T2), for every run tree £ satisfying ([7]), there exists at least one run p £ Run(_4, w) such that 
£ is in the image of the embedding f p . Hence 

l l w , n < ^ 

pGR\in(A,w) 

where ^ denotes the natural sum of ordinals. We can conclude 

J>^<zV"< ^ wt(p) ]-^ 

\p£Run(.A,™) / 
max{wt(/9)|/9GRun(^4,uj)}+l 

= J\A\\(w) + l 



and therefore 



X 1 . w = JWK»)+l. 



W.'ll 



Next consider the restriction Z^ of (i^; <trees) to the set of run trees £ with word(£) = w. 
Then n(s) < n(t) implies s <trecs £• Furthermore, the restriction of Zj, to the set of run trees £ 
with n(t) = n equals 1\ n . Hence 

ne(N:<) 

Next consider the restriction Zj of (Z.4; <t ree s) to the set of trees w% + D. Then 1%, = S by 
what we saw on page|H Let s be a run tree with word(s) = w and let £ £ w$ + D. Then s and £ 
coincide on 0* (where they both carry the sequence §w$). Consider sfio» and t Tio* - Since s is a 
run tree, we have dom(s) n 10* = {1, 10} while £f x £ D implies dom(£) n 10* = {1, 10, 100}. Hence 
s\i <trccs £fi an d therefore s <troos t. Hence, the restriction I w of (Z.4; <trees) to the set of run 
trees £ with word(£) = w and the set of trees w$ + D satisfies 



Finally, let u, v G S + . Then u <n cx v if and only if u <trees v. This implies 

(L A ;< tmes )= ]T 2 W = J2 ^ All{w)+1 + 5 = C A . 

4.3 Automorphisms 

From Theorem [5J we already know that the existence of a non-trivial automorphism of a word 
automatic and scattered linear order is Z^-hard. Here, we push this lower bound one level higher 
for tree automatic scattered linear orders. The order theoretic construction resembles that from 
Section [321 but also uses ideas from the previous section. 

Let M. be a Minsky machine, let Am an( i Bm De the weighted automata and, for m G N, let 
r M.m be the function defined following Theorem 2) Then we define the linear order 

C M = ( £ IIAmII ' w * +£r M , m ■ w) . 

mG(N;<) 

Lemma 6. From a Minsky machine M., one can construct a tree automatic presentation of the 
linear order Cm . 

Proof. Let M be a Minsky machine, let Am and Bm be the weighted automata and let rM,m ■ 
S + — > N be the function defined following Theorem [4j Recall that the alphabet of Am is E 
and that of Bm is Recall the notion of a run tree from the proof of Lemma H] that is based 
on a weighted automaton. In this proof, we will consider run trees with respect to the weighted 
automaton Am and with respect to the weighted automaton Bm ■ Now recall the definition of the 
language L Am and L Bm : 

La m = {t | t is a run tree wrt. Am} U {w$ + t \ w G U + , t € D} 
L Bm = {t | t is a run tree wrt. B M } U {w$ + £ | w) G G £>} 

Note that these two tree languages are disjoint since the alphabets E and are disjoint. Now 
define the language 

L M = {La m <E> $* ® $□(«□)*) 

UO®$ t ®fa(an) ra | togN,<gLbm, and 

(i is a run tree word(i) G S + ® $n(an) m )} . 

The crucial point regarding the regularity of this set is the verification that a tree s®$ fe ®$n(an) m 
with t a run tree of Bm belongs to the second set. But this is the case if sfo* = $$n(an) m $, a 
property that a tree automaton can check easily. 

On this set, we define the following linear order ^: (s <g> $ fe (g> $n(an) m ) ^ (t <g> f ® $n(an) n ) 
if and only if one of the following hold 



(01) 


m < n or 


(02) 


m 


= n, 


s G L Am , and i G L Bm , or 


(03) 


m 


= n, 


s, t G iyijv, , and k > £, or 


(04) 


rn 


= n, 


s, t G ^ytjv, , k = £, and s < 


(05) 


rn 


= n, 


s, i £ is^vi j and k < £, or 


(06) 


rn 


= n, 


s, i G Ljj^ , k — £, and s < t 



It is clear that this relation is automatic and it remains to be shown that (L»; ^) = Cm- 

For k, m > let Z,4,fe, m denote the restriction of (Lm', ^) to the set L Am ®% k ® %u(au) m . By 
(04) and Lemma [4] we get 

^A,k,m — £\\A M \\ ■ ( 8 ) 



Next let X_4 i7n denote the restriction of (Lm', ifD to the set Ia m ® $* ® $n(an) m . Then, (03) and 
© imply 

Now we consider the weighted automaton Bm'- For fc,m > let TB,k,m denote the restriction 
of (Lx; dd to the set of all trees t £3>$ fc £3>$D(an) m such that £ e 7^ and, if Ms a run tree of Bm, 
then word(i) G I7 + ® $n(an) m . By (06), Is,k,m is a restriction of (L Bm ; < t rccs) — &b m - Using 
the arguments from the proof of Lemma |4] again, we obtain 

Together with (05), this implies 

where m is the restriction of (Lm) ^) to the set of all trees f £S> % k £S> §u(au) m such that k > 
is arbitrary, i £ 7^ an£ ti if i is a run tree of Bm, then word(i) £ I7 + ® $n(an) m . 

Now, from (02), ((9|) and (fTO| . we obtain that the restriction of (i^vii ^) to the set of trees that 
define Ia,™, and Zg. m is isomorphic to 

^II^aiII ' w * + ' W . 

Finally, (01) implies 

mG(N;<) 
= . 

Theorem 6. The set of tree automatic presentations of rigid (rigid and scattered, resp.) 

linear orders is II 2 -hard, 
(ii) The set of tree automatic presentations of automatically rigid linear orders is H\-complete. 

Proof. (i) As in Section l3~2"l the linear order Cm is rigid if and only if the block 

B(m) = C\\a m \\ ' w * + A-M.m ' u 

is rigid for every m. But the linear order B(m) is rigid if and only if £11.4^11 p= C TM m if 
and only if m is accepted by M (by Theorem 2]). In summary, Cm is rigid if and only if the 
Minsky machine M accepts all numbers, a TTj -complete problem. Hence Lemma [5] reduces a 
T^-complete problem to the set of tree-automatic presentations of rigid (rigid and scattered, 
resp.) linear orders. 

(ii) Hardness follows from Corollary Hfii), containment in 77° can be shown as in the proof of 
Corollary DJii). 



5 Open questions 

The isomorphism and rigidity problems for word automatic scattered linear orders both belong to 
A®, (cf. [23] ). our lower bound 77° for the rigidity problem leaves quite some room for improvements. 
Since the rank of a tree automatic linear order is properly below |17ll6j , the proof of [23] can 
be adapted to show that the isomorphism and the rigidity problems for tree automatic scattered 
linear orders both belong to But we only have the lower bounds 77° and IT®, resp. Finally, 

the rigidity problem for arbitrary word or tree automatic linear orders is in Tl\, but also here, we 
only have the arithmetic lower bound 77" and 77^, resp. 

But the most pressing open question is the isomorphism problem of scattered and word auto- 
matic linear orders. 
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